arXiv: 1506.08319vl [cs.DS] 27Jun2015 


Greedy Is an Almost Optimal Deque 


Parinya Chalermsook 1 , Mayank Goswami 1 , Laszlo Kozma 2 , Kurt Mehlhorn 1 , and 

Thatchaphol Saranurak 3 * 

1 Max-Planck Institute for Informatics, Saarbriicken, Germany 66123 
2 Department of Computer Science, Saarland University, Saarbriicken, Germany 66123 
3 KTH Royal Institute of Technology, Stockholm, Sweden 11428 

Abstract. In this paper we extend the geometric binary search tree (BST) model 
of Demaine, Harmon, Iacono, Kane, and Patragcu (DHIKP) to accommodate 
for insertions and deletions. Within this extended model, we study the online 
Greedy BST algorithm introduced by DHIKP. Greedy BST is known to be equiv¬ 
alent to a maximally greedy (but inherently offline) algorithm introduced inde¬ 
pendently by Lucas in 1988 and Munro in 2000, conjectured to be dynamically 
optimal. 

With the application of forbidden-submatrix theory, we prove a quasilinear upper 
bound on the performance of Greedy BST on deque sequences. It has been con¬ 
jectured (Tarjan, 1985) that splay trees (Sleator and Tarjan, 1983) can serve such 
sequences in linear time. Currently neither splay trees, nor other general-purpose 
BST algorithms are known to fulfill this requirement. As a special case, we show 
that Greedy BST can serve output-restricted deque sequences in linear time. A 
similar result is known for splay trees (Tarjan, 1985; Elmasry, 2004). 

As a further application of the insert-delete model, we give a simple proof that, 
given a set U of permutations of [/?], the access cost of any BST algorithm is 
Q(\og\U\ + n ) on “most” of the permutations from 17. In particular, this implies 
that the access cost for a random permutation of [/z] is Q(n log n) with high prob¬ 
ability. 

Besides the splay tree noted before. Greedy BST has recently emerged as a plau¬ 
sible candidate for dynamic optimality. Compared to splay trees, much less effort 
has gone into analyzing Greedy BST. Our work is intended as a step towards a 
full understanding of Greedy BST, and we remark that forbidden-submatrix ar¬ 
guments seem particularly well suited for carrying out this program. 


1 Introduction 

Binary search trees (BST) are among the most popular and most thoroughly studied 
data structures for the dictionary problem. There remain however, several outstanding 
open questions related to the BST model. In particular, what is the best way to adapt 
a BST in an online fashion, in reaction to a sequence of operations (e.g. access, insert, 
and delete), and what are the theoretical limits of such an adaptation? Does there exist 
a “one-size-fits-all” BST algorithm, asymptotically as efficient as any other dynamic 
BST algorithm, regardless of the input sequence? 

Splay trees have been proposed by Sleator and Tarjan as an efficient BST al¬ 
gorithm, and were shown to be competitive with any static BST (besides a number of 
other attractive properties, such as the balance , working set , and static finger proper¬ 
ties). Furthermore, Sleator and Tarjan conjectured splay trees to be competitive with 
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any dynamic BST algorithm; this is the famous dynamic optimality conjecture G3- 
An easier, but similarly unresolved, question asks whether such a dynamically optimal 
algorithm exists at all. We refer to |7j for a survey of work related to the conjecture. 

A different BST algorithm (later called GreedyFuture) has been proposed indepen¬ 
dently by Lucas |8j and by Munro |9||. GreedyFuture is an offline algorithm: it antic¬ 
ipates future accesses, preparing for them according to a greedy strategy. In a break¬ 
through result, Demaine, Harmon, Iacono, Kane, and Patra§cu (DHIKP) transformed 
GreedyFuture into an online algorithm (called here Greedy BST), and presented a ge¬ 
ometric view of BST that facilitates the analysis of access costs (while abstracting away 
many details of the BST model). 

At present, our understanding of both splay trees and Greedy BST is incomplete. 
For splay trees, besides the above-mentioned four properties (essentially subsumetQby 
a single statement called the access lemma), a few other corollaries of dynamic optimal¬ 
ity have been shown, including the sequential access jT5j] and the dynamic finger 00 
theorems. The only known proof of the latter result uses very sophisticated arguments, 
which makes one pessimistic about the possibility of proving even stronger statements. 

A further property conjectured for splay trees is a linear cost on deque sequences 
(stated as the “deque conjecture” by Tarjan 1151 in 1985). Informally, a deque sequence 
consists of insert and delete operations at minimum or maximum elements of the current 
dictionary. Upper bounds for the cost of splay on a sequence of n deque operations are 
0(na(n)) by Sundar 03 and 0(na*(n)) by Pettie Here a is the extremely slowly 
growing inverse Ackermann function, and a* is its iterated version. A linear bound 
for splay trees on output-restricted deque sequences (i.e. where deletes occur only at 
minima) has been shown by Tarjan 05). and later im P roved by Elmasry (5). 

In general, our understanding of Greedy BST is even more limited. Fox |J6]| has 
shown that Greedy BST satisfies the access lemma and the sequential access theorem, 
but no other nontrivial bounds appear to be known. One might optimistically ascribe 
this to a (relative) lack of trying, rather than to insurmountable technical obstacles. This 
motivates our attempt at the deque conjecture for Greedy BST. 

As mentioned earlier, a deque sequence consists of insert and delete operations. In 
the tree-view, e.g. for splay trees, such operations have a straightforward implementa¬ 
tion. Unfortunately, the geometric view in which Greedy BST can be most naturally 
expressed only concerns with accesses. Thus, prior to our work there was no way to 
formulate the deque conjecture in a managable way for Greedy BST. 


Our contributions. We augment the geometric model of DHIKP to allow insert and 
delete operations (exemplified by the extension of the Greedy BST algorithm), and 
we show the offline and online equivalence of a sequence of operations in geometric 
view with the corresponding sequence in tree-view. This extended model allows us to 
formulate the deque conjecture for Greedy BST. We transcribe the geometric view of 
Greedy BST in matrix form, and we apply the forbidden-submatrix technique to derive 
the quasilinear bound 0(>n2 ,r< '"- m+n) + n) on the cost of Greedy BST, while serving a 
deque sequence of length m on keys from [«]. 

We also prove an 0(m + n ) upper bound for the special case of output-restricted 
deque sequences. We find this proof considerably simpler than the corresponding proofs 

4 Apart from a technicality for working set, that poses no problem in the case of splay trees 
and Greedy BST. 
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for splay trees, and we observe that a slight modification of the argument gives a new 
(and perhaps simpler) proof of the sequential access theorem for Greedy BST. 

As a further application of the insert-delete model we show through a reduction 
to sorting that for any BST algorithm, most representatives from a set U of permu¬ 
tations on [n] have an access cost of Oflog \U\ + n ). In particular, this implies that a 
random permutation of \n] has access cost Q{n log n) with high probability. A similar 
result has been shown by Wilber 03 for random access sequences (that might not be 
permutations). Our proof is self-contained, not relying on Wilber’s BST lower bound. 
Permutation access sequences are important, since it is known that the existence of a 
BST algorithm that is constant-competitive on permutations implies the existence of a 
dynamically optimal algorithm (on arbitrary access sequences). 


Related work. A linear cost for deque sequences is achieved by the multi-splay algo¬ 
rithm j4], Thm 3] in the special case when the initial tree is empty; by contrast, the 
results in this paper make no assumption on the initial tree. 

Most relevant to our work is the deque bound of Pettie for splay trees [10]. That 
result relies on bounds for Davenport-Schinzel sequences, which can be reformulated 
in the forbidden-submatrix framework. Indeed, the use of forbidden-submatrix theory 
for proving data structure bounds was pioneered by Pettie, who reproved the sequential 
access theorem for splay trees GD (among other data structure results). Our application 
of forbidden-submatrix theory is somewhat simpler and perhaps more intuitive: the 
geometric view of Greedy BST seems particularly suitable for these types of arguments, 
as the structure of BST accesses is readily available in a matrix form, without the need 
for an extra “transcribing” step. 


2 Geometric Formulation of BST with Insertion/Deletion 

In this section we extend the model of DHIKP [3j to allow for insertions and deletions. 
After defining our geometric model, we prove the equivalence of the arboreal (i.e. tree- 
view) and the geometric views of BSTs. 

2.1 Rotations and Updates 

Definition 1 (Valid Reconfiguration). Given a BST T\, a (connected) subtree t of T\ 
containing the root, and a tree t' on the same nodes as t, except that one node may be 
missing or newly added, we say that T\ can be reconfigured by an operation t —> r' to 
another BST 73 if T 2 is identical to T\ except for t being replaced by t' , meaning that 
the child pointers of elements not in t do not change. The cost of the reconfiguration is 

max{[r[, |t'|). 

This definition differs from Def.2] in that t' need not be defined on the same nodes 
as r. Note that, according to the definition, if an operation t —» r' changes a child 
pointer of an element x, then x e r. See Figure[T]for examples. 

Definition 2 (Execution of Update Sequence). Given an update sequence 

S = ((si, op^, (S 2 , op 2 ),..., (s m , op,,,)), where op,- e {access, insert, delete), 

we say that a BST algorithm executes S by an execution E = (To, r 1 —> t' v ... ,t„, —> 
r',) if all reconfigurations r f —> t\ transforming T t -\ to T, are valid, and for all t 
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- if op, = access, then s, e r, and t\ — t, as a set, 

- if op, = insert, then t\ — {s,)Ur, as a set, 

- if op, = delete, then t, = {s,}Ur' as a set. 

We also say that E executes S. The cost of execution of E is the sum over all recon¬ 
figuration costs. If an element x e t, U t\, we say that x is touched at time t. 



Fig. 1: (left) Examples of valid insert/delete operations. Circled elements indicate r and r'; (right) 
Examples of invalid operations: r does not contain root (above) and r' cannot link all 
pendant trees (below). 


We assume that we work over the set [n]. Each element can be inserted or deleted 
many times, but insertions and deletions on the same element must be alternating. We 
also assume that every element is accessed or updated at least once. 

2.2 Valid Sets 

Definition 3 (Geometric View of Update Sequence). The geometric view of an up¬ 
date sequence S is a point set P(S ) = A(S)Ol(S)OD(S ) in the integer grid [n] x [m] 
consisting of access points A(S) — {(s t ,t) | op, = access), insertion points I(S) = 
{( s,,t ) | op, = insert), and deletion points D(S ) = {(s,,t) | op, = delete). Update 
points are U(S) = I(S)(JD(S). 

We usually omit the parameter S and simply write A, /, D, U when the choice of S is 
clear from context. We denote the v-coordinate and f-coordinate of a point p by (p x , p,). 
By element x, we mean the column x. By time t, we mean the row t. 

Definition 4 (Valid Point). Given a point set P(S) in the integer grid [n] X [m], let p be 
a point (p may not be in P(S)), and let p',p" € U(S) denote the update points nearest 
to p, below (resp. above) p, i.e. p' x — p” = p x , and p', < p, < p". One or both of p' and 
p" might not exist. We say that p is valid in P(S), iff: 

-pi U(S), p' e I(S) (or does not exist), and p" e D(S ) (or does not exist), or 

- p e I(S), p' e D(S) (or does not exist), and p" e D(S ) (or does not exist), or 

- p e D(S), p' e I(S ) (or does not exist), and p" e I(S ) (or does not exist). 
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Let T, denote the resulting tree at time t during an execution of the BST algorithm 
E on the update sequence S. Observe that Definition [4] allows elements to be accessed 
or deleted without having been inserted before. Such elements are (implicitly) in the 
initial tree Tq. 

Fact 5 A point x can be touched at time t iff (x, t) is valid. 

Suppose that (x, t ) is valid. If (x, t ) is a deletion point, then x is in T, \ but not 7), 
and it is touched. If (x, t) is an insertion point, then x is in T, but not T, \ , and it is 
touched. If (x, t) is not an update point, then x is in both trees, and might or might not 
be touched. See Figure[2]for an illustration. 

Definition 6 (Predecessor/Successor of a Point). Given P(S ), the predecessor pred( p) 
of a point p is the largest element x' smaller than p x such that (x J , p,) is valid. We also 
write predip) = ( x J , p,) as a point. The successor succ(p) of p is symmetrically defined. 

Definition 7 (Valid Set). A point set P 2 PiS) is valid iff every point p e P is valid. 

For any node x in a tree T, let pred T (x) and 
succt(x) denote the predecessor, respectively suc¬ 
cessor of x in T. The following lemma shows that 
points in a valid set, and their predecessor and suc¬ 
cessor, are associated with nodes in the tree at the 
corresponding time. 

Lemma 8. Let P 2 P(S ) be a valid point set, 
and E executes S. For any p e U(S), we have 
pred(p) = pred Tp (p x ) and succ(p) = succr Pl ip x )• 

Proof. Let x' = pred(p) and hence ix', p,) is valid 
by definition. By Fact[5] x' can be touched at time 
p,. Since x' is not an updated element, we have 
Fig. 2: A point set with insert (O) and delete (x) points. Dashed x 6 T Pr Moreover, x is the closest element on 
lines indicate valid points. Observe that succ(x) = v 3 , the left of p x at this time. So x — pred T ^ (p x ). 
succiy) = v 2 , and pred(x) = pred(y) = v ,. The proof for successor is symmetric. □ 

Definition 9 (Active Time of Points). Let p be a point in a valid point set P 2 P(S). 
The active time act(p) of p is the maximal consecutive interx’al of time [ti„ s ip),tdeiip )] 
containing p, such that, for all t 6 act(p), (p x , t ) is valid. We call ti ns (p) insertion time 
of p, and tdelip) deletion time of p. 

2.3 Arboreally Satisfied Set 

Definition 10 (Geometric View of BST Execution). The geometric view of a BST 
execution E — (To, T\ —> t' ,..., r„, —> T , m ) of some update sequence S is the point set 
PiE) — {(x, t) | x e t, U tJ) in the integer grid, indicating which element is touched at 
which time. Note that PiE) 2 PiS ). 

Definition 11 (Arboreally Satisfied Set). A valid point set P 2 PiS) is (arboreally) 
satisfied iff the following holds: 
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- For each pair p,q e P that are both active from time p, to q, (called an active 
pair), either both p and q lie in the same vertical/horizontal line, or there is a point 
r e \I\ p qnP\{p, q}. If r is on the bottommost row ofO pq , then r cannot be a deletion 
point. Ifr is on the topmost row ofO pq , then r cannot be an insertion point. 

- For each update point p € U, if both pred(p) and succ(p) exist, then either pred(p) 
or succ(p) is also in P. 

The first condition is almost the same as the one in (3;, Def. 2.3] but focused only on 
active pairs (they are active from p, to q t ), and with additional technical condition due 
to update points. The second condition says that if the updated element is not the current 
minimum/maximum, then one of its adjacent elements must be touched. 

Note that if there are no update points, then all points are active the whole time and 
our definition is equivalent to [j3: Def. 2.3]. We defer the proof of the following fact to 
the appendix. 

Fact 12 Suppose that P is satisfied. Then, for each pair p,q e P which are both active 
from time p, to q t and p, < q t , there exists a point in P \ { p , q) on a side ofO pq incident 
to p, that is either a non-deletion point, or the corner (p x ,q,). Similarly, there exists a 
point in P \ {p, q) on a side ofO pq incident to q, that is either a non-insertion point, or 
the corner (q x , p t ). 


3 Equivalence of Arboreal and Geometric Views 

In this section we prove the following theorem: 

Theorem 13. A point set P is satisfied iff P = P(E) for some BST execution E. 

The first direction of the proof involves considering a BST algorithm and showing 
that it generates a satisfied point set (tree to geometry). The second direction is showing 
how to convert a satisfied point set to a BST algorithm (geometry to tree). 


3.1 Tree to Geometry 

Lemma 14. Let x and z be elements with consecutive values in a BST T, with x < z. 
Then one of x and z. is an ancestor of the other. 

Proof. Suppose not. Then the lowest common ancestor of x and z is another element y. 
We know x <y < z which is a contradiction. □ 

Lemma 15. Suppose that y is not the minimum or maximum element in a BST T. To 
insert or delete y in T, either pred T (y ) or succriy) must be touched. 

Lemma 16. For any execution E, a point set P(E) is satisfied. 

Proof. There are two conditions that need to be checked. 

For the first condition, let p, q be a pair of points in P(E) active from time p, to q,. 
Suppose that p, q violate the condition. Hence, they are not vertically or horizontally 
aligned. We assume that p, < q, and p x < q x . Since p x and q x are active at time p t , 
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by Fact [5] and the statement below the fact, they exist in the tree T p . Hence, a lowest 
common ancestor a of p x and q x in T Pt is well-defined. There are two cases. 

If a — p x , then p x is an ancestor of q x . Since O pq is not satisfied, q x is not touched 
from time p t to q, - 1 and p x remains an ancestor of q x right before time q,. Thus, to 
touch q x at time q u p x must be touched, and so (p x , q,) e O pq . Only insertion point can 
be in the topmost row of unsatisfied O pq . So (p x , q,) an insertion point. But this implies 
that p and q are not active pair, which is a contradiction. 

If a + p x , then a must be touched at time p t . As a has value between p x and q x , we 
have ( a,p t ) e O pq . Since \H pq is not satisfied, ( a,p t ) is a deletion point and, moreover, 
p x must be its predecessor. Hence p x becomes an ancestor of q x right after time p, and 
we can use the previous argument again. 

For the second condition, suppose that p e U is an update point. That is, we update 
p x in the BST T Pt . If both pred(p) and succ(p) exist, then p x is not a minimum or 
maximum in T Pl . By Lemma 15 either pred T (p x ) or succt p (p x ) is touched at time p,. 
By Lemma[8j pred Tp (p x ) = pred(p) and succr pi (p x ) = succ(p), and we are done. □ 


3.2 Geometry to Tree 

Now we show how to convert a valid point set to an offline algorithm first. We need the 
following lemma, which is essentially a converse of Lemma [15] saying that if we touch 
either pred T (y ) or succjiy), then we can insert or delete y. We defer the proofs of the 
following two statements to the appendix. 

Lemma 17. Suppose either pred T (y) or succjiy ) is in a subtree t containing the root of 
T, or y is the minimum or maximum element in T. Then (i) any reconfiguration t —> P , 
where t' — tU{v) as a set, is valid, and(ii) any reconfiguration t —* t', where t = r'Ufy} 
as a set, is valid. 


Lemma 18 (Offline Equivalence). For any satisfied set X, there is a point set P(E) = X 
for some execution E. We call E a tree view ofX. 


By Lemma 16 and 18 this concludes the proof of Theorem 13 


Observe that if X = P(E), the quantity |X| is exactly the execution cost of E. 


3.3 Geometry to Tree: Online 

The discussion in § |3.2| assumes that a satisfied set X is available all at once, and we 
show that there exists an execution E (i.e. an offline BST algorithm) whose point set 
P(E) is exactly X. 

We call an online geometric algorithm an algorithm that, given a geometric update 
sequence P(S) c [n] X [m], outputs a satisfied superset P 2 P(S), with the condition 
that both the input and output are revealed row-by-row (i.e. the decision on which points 
to touch can depend only on the current and preceding rows of the input). We remark 
that Greedy BST (as extended in §[4| is such an algorithm. 

Analogously, by an online BST algorithm we mean a procedure that, given an initial 
set So £ [n], and an update sequence S , outputs an execution E, with the condition that 
both the input and output are revealed item-by-item (i.e. the decision on which recon¬ 
figuration to perform can depend only on the current and preceding update operations). 
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Theorem 19 (Online Equivalence). For any online geometric algorithm !B, there ex¬ 
ists an online BST algorithm FB! such that, on any update sequence, the cost of IB' is 
bounded by a constant times the cost of IB. 


The proof of Theorem 
new geometric setting, and 
in this extended abstract. 


19 is an adaptation of the proof of Lemma 2.3 in 0 to the 


is analogous to the proof of Lemma 18 We omit the proof 


4 Defining Greedy BST with Insertion/Deletion 

Greedy BST is an online algorithm for constructing a satisfied set given an update se¬ 
quence S. At each time t , Greedy BST minimally satisfies the point set up to time t. 
Having defined satisfied sets when there are update points, we naturally obtain the ex¬ 
tension of Greedy BST that can handle insertions and deletions. 

We develop some notation for describing the al¬ 
gorithm. A rectangle O pq is unsatisfied if there is no 
other point in the proper (closed) rectangle formed 
by points p and q. We say that p and q are an active 
pair if they are active from time p, to q t . The stair of 
point p is denoted by stair(p) = {p} U {q | is un¬ 
satisfied rectangle formed by an active pair p and q 
where q is below p\. The stair of element x at time 
t is the stair of the point (x, t). Satisfying/touching 
stair(x, t ) means visiting/touching, at time f, the el¬ 
ements of points in the stair: {(q x , t) \ q e stair(x, f))- 
These elements are then added to the row at time t. 

Fact 20 Touching the stair stair(p) is to minimally 
satisfy the point p. 

Fig. 3: A Greedy BST execution with insert (O), delete (x), ac¬ 
cess (G), touched (•) points, and touched points at time Therefore, when Greedy BST gets an access point 
p, (♦). Thick line shows stair of p. Observe that a non- p , it touches only stair(p). For an update point 
(min/max) insert or delete must access a neighbor as well. ^ if p j s not t h e minimum or maximum, then 

Greedy BST chooses the smaller set between 
stair(p) U stair(pred(p)) and stair(p) U stair(succ(p)). This is because of the second 
condition of satisfied set. If p is the minimum or maximum, then Greedy BST just 
touches stair(p). The execution of Greedy BST is illustrated in Figure[3] 

The following observation is useful for deque sequences. For insertion point p, ob¬ 
serve that stair(p) = \p) because the active time of p begins at time p, itself (for any 
point q below p , p and q are not an active pair by definition). 

Fact 21 To insert p such that p is the minimum or maximum, Greedy BST touches only 
P- 

5 Performance of Greedy BST on Deque Sequences 

Definition 22 (Deque Sequence). An update sequence is a deque sequence if it has 
only insertions and deletions at the current minimum or maximum element, and no 
access operations. 
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Definition 23 (Output-restricted). A deque sequence is output-restricted if it has dele¬ 
tions only at minimum elements. 

Theorem 24. The cost of executing a deque sequence on [n\ of length m by Greedy BST 
is at most 0(m2 a< -' n ’ n+m + n), where a is the inverse Ackermann function. 

Theorem 25. The cost of executing an output-restricted deque sequence on [n\ of length 
m by Greedy BST is at most 24 m + 12 n. 



Fig. 4: Sample execution of Greedy BST on a 
concentrated deque sequence with insert 
(O), delete (x), and touched (®) points. 
Dashed lines show the active times of el¬ 
ements. 


Remark. The bound in Theorem [25] refers to the cost of 
the online geometric Greedy BST. In the online tree-view 
equivalent the constants can be larger, hinging on the de¬ 
tails of Theorem [19] but the bound remains of the form 
0(m + n). 

The rest of this section is devoted to the proofs of The¬ 
orems [24] and [25] 

5.1 Concentrated Deque Sequences 

We first reduce the analysis of Greedy BST on any deque 
sequence to that on a special type of deque sequence that 
we call a concentrated deque sequence. Recall that in a 
deque sequence we can delete only the current minimum 
or maximum. We define two sets of elements as follows: let 
L, be the set of elements which are deleted (from the left) 
before time t when they were the minimum at their deletion 
time, and R, be the set of elements which are deleted (from 
the right) before time t when they were not the minimum at 
their deletion time. Observe that L, HR, — 0. 


Definition 26 (Concentrated Deque Sequence). A deque sequence is concentrated if 
for any time t, if the inserted element x is the minimum, then y < x for all y 6 L t , and if 
x is the maximum, then x < y for all y e R,. 

Note that the definition implies that each element in a concentrated deque sequence 
can be inserted and deleted at most once. We defer the proof of the following lemma to 
the appendix. 


Lemma 27. For any deque sequence S, there is a concentrated deque sequence S' such 
that the execution of any BST algorithm on S' and S have the same cost. 


5.2 Greedy BST on a Concentrated Deque Sequence 

Now we analyze the performance of Greedy BST on concentrated deque sequences 
(see Figure[4]for an example). Because of Lemma |27] we can view the points touched 
by Greedy BST as an (m x (n + m)) binary matrix (i.e. with entries 0 and 1), with all 
touched points represented as ones, and all other grid elements as zeroes. Notice that the 
number of columns is n + m instead of n because of the reduction in Lemma [271 which 
allows each element to be inserted and deleted at most once. We further observe that 
if a deque sequence is output-restricted, then the transformation of Lemma [27] yields a 
concentrated deque sequences that is similarly output-restricted. 
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Definition 28 (Forbidden Pattern). A binary matrix M is said to avoid a binary matrix 
P (called a pattern) if there exists no submatrix M' of M with same dimensions as P, 
such that for all l-entries of P, the corresponding entry in M' is 1 (the 0-entries of P 
are “don’t care” values). 

We denote by Ex(P, m, n) the largest number of Is in an (mxn) matrix M that avoids 
pattern P. In this work, we refer to the following patterns (as customary, we write dots 
for 1-entries and empty spaces for 0-entries). 


\ 



and P 4 = 


Lemma 29. The execution of Greedy BST on concentrated deque sequences avoids 
the pattern P$. 

Proof. Suppose that If appears in the Greedy BST execution, and name the touched 
points matched to the 1-entries in P 5 from left to right as a, b, c, d. and e. 

Let t > b, be smallest such that (c x ,t) is touched. Then t < c t and either h or d 
must have been deleted within the time interval [b,,t]. Otherwise, any update point in 
the interval [b t , t] is outside the interval [b x , d x ] and c x is “hidden” by b and d (it cannot 
be on the stair of any update point). 

Assume w.l.o.g. that b is deleted. If b is deleted by a minimum-delete, then a cannot 
be touched. If b is deleted by a maximum-delete, then e cannot be touched. This is 
because the sequence is concentrated. □ 

Lemma 30. The execution of Greedy BST on concentrated output-restricted deque se¬ 
quences avoids the pattern P 4 . 

Proof: Suppose that P 4 appears in the Greedy BST execution, and name the touched 
points matched to the 1-entries in P 4 from left to right as a , b. c, and d. We claim that 
in order to touch c, there has to be a deletion point in the interval [b x , d x \ in the time 
interval [d t ,Ct\. Otherwise, any deletion point in the time interval [d t , c,] is left of b x 
(as deletes happen only at the minimum). Furthermore, all insertion points in the time 
interval [b t , c t ] must be outside of [b x , d x ] (since both b and d are active at time b,). We 
remind that insertion touches nothing else besides the insertion point itself. This means 
that c cannot be touched: it is “hidden” to deletion points on the left of b x by b. 

Denote the deletion point in the rectangle [b x ,d x ] x [d t , c f ] as d'. Observe that a is 
to the left of and above d ', and since we only delete minima, a is not active at time d\. 
In order to be touched, a must become active after d\ via an insertion, contradicting that 
the sequence is concentrated. □ 


Fact 31 ([12, Thm 3.4]). E x(P 5 ,u,v) = 0(u2 aM + v). 
Fact 32 ((15 Thm 1.5(5)]). Ex(B 4 , u, v) < 12 (u + v). 


10 




Proof of Theorem \24\ By Lemma [27] it is enough to analyze the cost of Greedy BST 
on concentrated deque sequences. This cost is bounded by 0(m2 a(m ’ m+n ^ + n) using 
Lemma l29l and Factl3T1 


Proof of Theorem |25|- By Lemma [27] it is enough to analyze the cost of Greedy BST 
on concentrated deque sequences. This cost is bounded by 24m + 12 n using Lemma 30 
and Factl32l 


Remark. The proof of Theorem [25] can be minimally adjusted to prove the sequential 
access theorem for Greedy BST. Sequential access can be simulated as a sequence of 
minimum-deletions. In this way we undercount the cost by exactly one touched point 
above each access, which adds a linear term to the bound. 


6 A Lower Bound on Accessing a Set of Permutations 

Let U be a set of permutations on [n]. In this section we prove the following theorem: 

Theorem 33. Fix a BST algorithm PR and a constant e < 1. There exists U c U of size 
\U'\ > (1 - jjjp)|t/| such that PR requires £2(log \U\ + n ) access cost on any permutation 
in U . 

Proof. The proof utilizes the geometric view of insertions, and uses two reductions. We 
first claim that there exists an algorithm B that is capable of insertions such that the cost 
of PR to access a permutation it is no less than the cost of 'B to insert n. Note that since PR 
is accessing n , all the points are active by definition. We will describe B in the geometric 
view simply by requiring that upon inserting n{t) at time t, B touches all the points that 
PR touches while accessing nit) at time t. Note that PR touches at least all the points in 
stair(n(t), f), and B is required only to touch either pred(n(t)) and its stair, or succ{n{t)) 
and its stair (Definition 11). Since pred(n(t)) belongs to slair(7t{t),t). one easily sees 
that stair(pred(p)) c stair(n(t), f), and this defines a valid insertion algorithm. 

We now reduce PB to an algorithm for sorting n. Just by a traversal of the tree main¬ 
tained by B at time n, we can produce the sorted order of n after incurring a cost of Oin). 
However, we know that to sort a set U of permutations, any (comparison-based) sorting 
algorithm must require £21 log | U\ + n) comparisons on at least al - fraction of the 
permutations in U. To see this, note that the decision tree of any sorting algorithm must 
have at least \U\ leaves (note that here we are assuming the weaker hypothesis that PR 
and hence the sorting algorithm, are only designed to work on U ; they may fail outside 
U). The number of leaves at height at most (1 - e) log \U\ is at most \U\ 1_e , and hence at 
least a 1 - fraction require at least (1 - e)log \U\ = £2(log |(/|) comparisons. Adding 
the trivial bound of Q{n) to scan the input permutation gives us the desired bound. 

Remark. Upper bounds proved for our model do not directly translate into bounds for 
algorithms. For example, when a new maximum is inserted, this can be done at a cost of 
one by making the element the root of the tree, respectively, only touching the element 
inserted. Note that this requires the promise that the element inserted is actually a new 
maximum. A slight extension makes the model algorithmic. This is best described in 
tree-view. We put all nodes of the tree in in-order into a doubly-linked list. Then, in 
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the case of an insertion one can actually stop the search once the predecessor or the 
successor of the new element has been reached in the search because by also comparing 
the new element with the neighboring list element, one can verify that a node contains 
the predecessor or successor. Thus at the cost of a constant factor, bounds proved for 
our model are algorithmic. □ 

Acknowledgement. We thank an anonymous reviewer for valuable comments. 
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A Proof Omitted from Section |2| 


A.l Proof of Fact ITU 


We give the proof only for p, as it is symmetric for q. 


Since P is satisfied, there is a point r in □ 


'pi 


np\{p, q ) that satisfies D pq . If r is on 
the horizontal side incident to p, then it is not a deletion point, otherwise it would not 
satisfy O pq by the first condition of Definition 11 If r is on the vertical side incident 


to p, and not ( p x ,q t ), then it is not a deletion point, otherwise p and q would not be an 
active pair. Thus, if r is on a side of \3 pq incident to p, we are done. Suppose this is 
not the case, and let r be a point in O pq n P \ {p, q) satisfying O pq such that t ms (r) is 
minimum. We claim that t ins (r) < p,. 

First, we have f, nj (r) < q, because otherwise t lns (r) = q, and hence r is an insertion 
point on the topmost row of O pq , and cannot satisfy D pq due to the first condition of 
Then there must be some other point in O pq C\P\{p,q), whose insertion 
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Definition 

time is before q t , a contradiction. 

Next, suppose t,„ s (r) > p, then the point r’ = {r x , ti ns {r)) is an insertion point, and, 
by the second condition of Definition 11 there is another point r" e O pq which is 
either pred{r’) or succ(r'). Note that r" ± p, q because p, < tj ns (r) < q,. Observe that 
tms{r") < ti ns (r ), contradicting the choice of r. 

Now, since tj ns (r) < p t , p and r are both active from time p, to r t , and we can repeat 
the same argument as long as r is not on the sides of O pq incident to p. 


B Proof Omitted from Section [3] 


B.l Proof of Lemma HSl 

Since y is not the minimum or maximum, both predecessor pred T (y) and successor 
succjiy) of v exist. Let x = pred T (y) and z = succj(y). We consider two cases: insertion 
and deletion. 

For insertion of y, before we insert, y £ T holds. Then x and z are consecutive, and, 
by Lemma [l4| we assume by symmetry that x is an ancestor of z. In particular, x has 
right child. After insertion, x and y are consecutive. So, by Lemma [14} either x or y is 
an ancestor of the other. If x is an ancestor of v, then x must have been touched so that 
we can change pointers below x. Otherwise, y is an ancestor of x and x cannot have a 
right child. Thus, x’s right child pointer must have been changed to null, meaning that 
x has been touched. 

For deletion of y, suppose that we do not touch both x and z. If y has at most one 
child, then one of x and z. is an ancestor of y, but we must touch all the ancestors of y, 
a contradiction. If y has two children, then x has no right child and z has no left child. 
Suppose we delete y without touching x and z. Therefore, x still has no right child and 


z still has no left child even after y £ T. This contradicts Lemma 14 


B .2 Proof of Lemma [17] 

Suppose that x = pred T (y) e r. The proof when succjiy) e t is symmetric. There 
are two statements to be proved regarding insertion and deletion of y, respectively. Let 
child r{x) denote a right child pointer of element x. 
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For insertion, we show that r —> r' where r' = rUjy) is valid. First, we insert y 
into r as a right child of x. The only pointer changes are: child K (y) <— child K (x) and 
childR(x) <— y. Finally, we rotate the resulting subtree, which includes y, to get r'. 

For deletion, we show that r —> r' where r = t'U{v) is valid. Again, we rotate 
r such that y is a right child of x, and then remove y. The only pointer change is: 
child R (x ) <— child R (y). Then we rotate the resulting subtree, which excludes y, to get r'. 


B .3 Proof of Lemma [18] 


We use the almost same argument as in |3j Lemma 2.2] but we need to make sure that 
we can also update elements while touching all points in X exactly. The argument is as 
follows. 

Define the next touch time Nix, f (l ) of x at time to in X to be the minimum t- 
coordinate of any point in X on the ray from (x, to) to (x, oo). If there is no such point, 
then N(x, to) — oo. 

Let T, be the treap defined on all points (x, Nix, t)) active right after time t. Recall 
that a treap is a BST on the first coordinate and a heap on the second. Let X t denote the 
set of elements in the row t of X. Since T, is a treap with heap priority /V(-, t), r, = X t C\T t 
is connected subtree containing the root of T t . So we have t, = X r , and r, = X, \ jyj if 
we insert y at time 1 , as desired. 

If there is an update element y in X t , and pred T (y) and succjiy) exist, by Lemma 17 
we just need to show that either pred T (y) e X, or succt, (y) e X t . Since X is satisfied, 
either pred(y,t) e X, or succ(y,t) e X,, say pred(y,t) e X,. By Fact [8] pred Ti (y) = 
prediy, t). Therefore, t, —> rj is a valid reconfiguration where r, U t' = X t . 

After, we update y in r, and get t\, we want to get T t+ \ which is a treap defined on 
N(-,t + 1). To get this, we just heapify rj based on N(-, t + 1). We claim that the whole 
tree is now T t+ 1. The following argument is exactly same as in |3] Lemma 2.2]. Suppose 
there is a parent/child (q, r) that heap property does not hold. Both q, r cannot be in r] 
by construction. The next touch time of elements outside rj does not change, so both 
q, r cannot be outside rj. 

Now, we have q e r,' and r <£ t[ where N(r,t + 1) < N(q,t +1). The rectangle 
defined from (q, t) and ( r,N{r,t +1)) will contradict Fact[l2] There are two sides to 
be considered. First, there is no point on the vertical side ((q, t), (q, Nir , t + 1)] because 
N(r , t + 1) < N(q, t + 1). Next, all elements in T t+ \ between q and r must be descendants 
of r, and they cannot be touched as r is not touched at time t. So the horizontal side 
i(q, t), (r, f)] can only have one deletion point s which has q x as a predecessor/successor. 


This violates Fact 12 and completes the proof. 


C Proof Omitted from Section [5] 

C.l Proof of Lemma l27l 

Suppose that S is not concentrated, and let to be the first time when S violates this 
condition. We modify the sequence and obtain another sequence S ' such that the first 
violation time is later than to, and the executions of S and S ’ on any BST algorithm are 
the same, and repeat the argument. 
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So, assume w.l.o.g. that the element x is inserted as the minimum at time f (l . Since 
the condition is violated, x < y for some y e L t . Let x' be an element such that y < x', 
for all y e L to , and xf is less than all elements in the current tree T, 0 . Note that xf must 
exist, because there is no violation before time fo- 

Now, since BST is a comparison-based model, as long as the relative values of all 
following update elements are preserved, even when the sequence is modified, the BST 
algorithm would behave the same. 

Therefore, we modify S such that we set the value of x to be x' while preserving the 
relative values of all following update elements. So now the condition is not violated at 
time to while the execution of the modified sequence is unchanged. 
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